A Travel Data Collection Method Based on Big-data from a Smart Phone APP
-
摘要: 城市居民的交通出行是交通研究中的基础问题.结合当前手机应用的特点,以手机应用与无线通信网络数据交互时产生的 CI(Cell-ID Identify)定位数据为研究对象来获取居民出行数据.采集了“易信”手机应用2013与2014年共3241238条有效的 CI 定位数据,并对其进行了降维、离散化、去噪预处理.提出将定位数据标准化为0-1数据矩阵,以矩阵运算的形式推导用户出行 OD 矩阵、各小区出入流量等二次数据的算法模型,并在 Matlab 中进行了实现.结果表明,相较于遍历数据的循环算法,推导效率有显著提升.提出了推导 OD 矩阵的完整性与真实性评价指标 R ,并计算得到2013的 R 为19.1%,2014为69.3%.发现手机应用日均数据量较大的 CI 数据具有更高的完整性与真实性(2013年为10.6条,2014年为47.4条),但该指标主要反映所有小区中存在交通出行的整体情况,对各小区的出行行为的完整性与真实性尚需做进一步研究.Abstract: travel of Urban residents is one of the fundamental questions in transportation research.According to the characteristics of the current mobile phone application,this paper focuses on extracting residents'travel data from CI (Cell-ID Identify)location data generated during the interaction between mobile phone application and wireless communi-cation network.A total of 3 241 238 CI positioning data in 2013 and 2014 has been collected from"Yi Xin"App,which is then preprocessed through dimensionality reduction,discretization and de-noising.This paper develops an efficient matrix operation algorithm to extract origin/destination information and inbound and outbound traffic flow data of each residen-tial area based on the conversion of CI positioning data into standardized 0-1 matrices.Matlab is used to implement this algorithm.The results show that,comparing to circulation algorithm of data traversal search,this algorithm can achieve a higher efficiency which shorten the time from days to minutes.An index R has been presented to evaluate the integrity and authenticity of the derivate users′OD matrices.The index R of OD matrices is 1 9.1% in 2013,and 69.3% in 2014. The results indicate that the CI positioning data with higher daily data amount (10.6 in 2013 and 47.4 in 2014)has higher integrity and authenticity.However,this index only reflects the overall condition of trips from all residential areas,and the fact if this index can fully and truthfully represent travel behavior of each individual traffic analysis zone should be fur-ther studied.
点击查看大图
计量
- 文章访问数: 355
- HTML全文浏览量: 30
- PDF下载量: 0
- 被引次数: 0